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Abstract 

The existence of coronaviruses in bats is unknown until the recent discovery of bat-SARS-CoV in Chinese horseshoe bats and a novel group 1 
coronavirus in other bat species. Among 309 bats of 13 species captured from 20 different locations in rural areas of Hong Kong over a 16-month 
period, coronaviruses were amplified from anal swabs of 37 (12%) bats by RT-PCR. Phylogenetic analysis of RNA-dependent-RNA-polymerase 
( pol ) and helicase genes revealed six novel coronaviruses from six different bat species, in addition to the two previously described coronaviruses. 
Among the six novel coronaviruses, four were group 1 coronaviruses (bat-CoV HKU2 from Chinese horseshoe bat, bat-CoV HKU6 from ricketf s 
big-footed bat, bat-CoV HKU7 from greater bent-winged bat and bat-CoV HKU8 from lesser bent-winged bat) and two were group 2 
coronaviruses (bat-CoV HKU4 from lesser bamboo bats and bat-CoV HKU5 from Japanese pipistrelles). An astonishing diversity of 
coronaviruses was observed in bats. 

© 2006 Elsevier Inc. All rights reserved. 
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Introduction 

Coronaviruses are found in a wide variety of animals in 
which they can cause respiratory, enteric, hepatic and 
neurological diseases of varying severity. Based on genotypic 
and serological characterization, coronaviruses were divided 
into three distinct groups (Lai and Cavanagh, 1997; Ziebuhr, 
2004; Brian and Baric, 2005). As a result of the unique 
mechanism of viral replication, coronaviruses have a high 
frequency of recombination (Lai and Cavanagh, 1997). Their 
tendency for recombination and high mutation rates may allow 
them to adapt to new hosts and ecological niches. 

The recent severe acute respiratory syndrome (SARS) 
epidemic, the discovery of SARS coronavirus (SARS-CoV) 
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and the identification of SARS-CoV-like viruses from 
Himalayan palm civets and a raccoon dog from wild live 
markets in mainland China have led to a boost in interests in 
discovery of novel coronaviruses in both humans and animals 
(Guan et al., 2003; Marra et al., 2003; Peiris et al., 2003; Rota 
et al., 2003; Woo et al., 2004a, 2004b). In 2004, a novel group 
1 human coronavirus, human coronavirus NL63 (HCoV- 
NL63) was reported independently by two groups (Fouchier et 
al., 2004; van der Hoek et al., 2004). In 2005, we described 
the discovery, complete genome sequence, clinical features 
and molecular epidemiology of another novel group 2 human 
coronavirus, coronavirus HKU1 (CoV-HKUl) (Woo et al., 
2005a, 2005b, 2005c, 2005d). Recently, we have also 
described the discovery of SARS-CoV-like virus in Chinese 
horseshoe bats and a novel group 1 coronavirus in large bent¬ 
winged bats, lesser bent-winged bats and Japanese long¬ 
winged bats in Hong Kong Special Administrative Region 
(HKSAR) (Lau et al., 2005; Poon et al., 2005). Others have 
also discovered the presence of SARS-CoV-like viruses in 
horseshoe bats in other provinces in China (Li et al., 2005). 
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Based on these findings, we suspected that a previously 
unrecognized large diversity of coronaviruses is present in bats 
in our locality. To test this hypothesis, we carried out a 
territory-wide coronavirus surveillance study in bats in 
HKSAR. 

In this article, we report the discovery of six different novel 
coronaviruses in 13 bats of six different species in HKSAR. The 
molecular epidemiology of the novel and previously reported 
coronaviruses in bats are also described. 

Results 

Bat surveillance and identification of novel coronaviruses 

A total of 618 nasopharyngeal and anal swabs from 309 bats 
of 13 different species were obtained from rural areas in 
HKSAR (Fig. 1, Table 1). RT-PCR for a 440-fragment in the pol 
genes of coronaviruses were positive in anal swabs from 37 
(12%) of 309 bats. These 37 bats were of five genera and six 
species, distributed over 10 of the 20 different sampling sites 
(Table 1). None of the nasopharyngeal swabs was positive for 
coronaviruses. 

Viral culture 

No cytopathic effect was observed in any of the cell lines 
inoculated with bat specimens. Quantitative RT-PCR using the 


culture supernatants and cell lysates for monitoring the presence 
of viral replication also showed negative results. 

Phylogenetic and other in silico analysis of pol and helicase 
genes of coronaviruses 

The complete pol and helicase genes of the coronaviruses from 
the bats were amplified and sequenced. Phylogenetic analysis of 
both genes revealed the presence of eight different coronaviruses 
(Table 1, Fig. 2). The bat-SARS-CoV that we described recently 
was present in anal swabs of 21 Chinese horseshoe bats (Lau et 
al., 2005). Another coronavirus that we described previously 
(Poon et al., 2005) was detected in anal swabs of three lesser bent¬ 
winged bats (bat-CoV strains 61-1,61-2 and 61-3). 

In addition to these two known coronaviruses, six novel 
coronaviruses were identified from 13 bats of six different 
species. They include a novel coronavirus in two Chinese 
horseshoe bats, a novel coronavirus in four lesser bamboo bats, 
a novel coronavirus in four Japanese pipistrelles, a novel 
coronavirus in a rickett’s big-footed bat, a novel coronavirus in 
a greater bent-winged bat, and a novel coronavirus in a lesser 
bent-winged bat. These six novel coronaviruses were named as 
bat coronaviruses HKU2, HKU4, HKU5, HKU6, HKU7 and 
HKU8 (bat-CoV HKU2, bat-CoV HKU4, bat-CoV HKU5, bat- 
CoV HKU6, bat-CoV HKU7 and bat-CoV HKU8) respectively. 
Phylogenetic analysis of both pol and helicase genes showed 
similar topologies and revealed that bat-CoV HKU2, bat-CoV 



Fig. 1. Map of HKSAR showing locations of bat surveillance indicated by dots. Areas belonging to Shenzhen of mainland China are shaded. Solid dots represent the 10 
locations with bats positive for coronaviruses. Location A is where CoV HKU4-1, bat-CoV HKU4-2 and bat-CoV HKU4-3 were found, location B is where bat-CoV 
HKU4-4 and bat-CoV HKU5-1 were found, and location C is where bat-CoV HKU5-2, bat-CoV HKU5-3 and bat-CoV HKU5-5 were found. 
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Table 1 


Bat species captured and associated coronaviruses in the present surveillance study 


Bats 




Coronaviruses 

Sampling locations for the 

Scientific name 

Common name 

No. of 
bats tested 

No. (%) of bats positive 
for coronaviruses 


positive specimens a 

Cynopterus sphinx 

Short-nosed fruit bat 

2 

0(0) 

— 


Hipposideros 

Great round-leaf bat 

13 

0(0) 

— 


armiger 

Hipposideros 

Pomona roundleaf bat 

18 

0(0) 

— 


pomona 

Miniopterus 

Greater bent-winged bat 

51 

1 (2) 

Bat-CoV HKU7 b 

B 

magnater 

Miniopterus 

pusillus 

Lesser bent-winged bat 

25 

4(16) 

Bat-CoV HKU8 b (n = l) c 
Coronavirus previously 
reported (n = 3) c 

D (Bat-CoV HKU8-1) 

E (Bat-CoV 61-1, 61-2 and 61-3) 

Myotis chinensis 

Greater mouse-eared bat 

8 

0(0) 

— 


Myotis ricketti 

Rickett’s big-footed bat 

23 

1(4) 

Bat-CoV HKU6 b 

H 

Nyctalus noctula 

Noctule bat 

7 

0(0) 

— 


Pipistrellus 

abramus 

Japanese pipistrelle 

14 

4 (29) 

Bat-CoV HKU5 b 

B (Bat-CoV HKU5-1) 

C (Bat-CoV HKU5-2, 

HKU5-3 and HKU5-5) 

Rhinolophus affinus 

Intermediate horseshoe bat 

7 

0(0) 

— 


Rhinolophus sinicus 

Chinese horseshoe bat 

118 

23 (19) 

Bat-CoV HKU2 b (n = If 

Bat-SARS-CoV 

(n = 2\f 

D (Bat-CoV HKU2-1) 

J (Bat-CoV HKU2-2) 

F, H, I (Bat-SARS-CoV HKU3) 

Rousettus 

Leschenault’s rousette 

2 

0(0) 

— 


lechenaulti 

Tylonycteris 

pachypus 

Lesser bamboo bat 

21 

4(19) 

Bat-CoV HKU4 b 

A (Bat-CoV HKU4-1, 

HKU4-2 and HKU4-3) 

B (Bat-CoV HKU4-4) 


a The sampling locations are those depicted in Fig. 1. 
b Novel coronaviruses described in the present study. 
c No. of bats positive for this coronavirus species. 


HKU6, bat-CoV HKU7 and bat-CoV HKU8 are group 1 
coronaviruses, whereas bat-CoV HKU4 and bat-CoV HKU5 
are group 2 coronaviruses (Fig. 2, Table 2). 

For the coronaviruses with complete genome sequences 
available, the G + C contents of the genomes concurred with the 
G + C contents of the pol and helicase genes of the 
corresponding coronavirus (Table 2). The G + C contents of 
the six novel coronaviruses were 0.39, 0.37, 0.41, 0.39, 0.40 
and 0.40 for pol genes and 0.40, 0.38, 0.44, 0.39, 0.40 and 0.42 
for helicase genes respectively. 

Nucleotide polymorphisms in Bat-CoV HKU2, Bat-CoV HKU4 
and Bat-CoV HKU5 

Both strains of bat-CoV HKU2 possessed the same nucleo¬ 
tide sequence for pol and helicase genes. Of the four strains of 
bat-CoV HKU4, bat-CoV HKU4-1, bat-CoV HKU4-2 and bat- 


CoV HKU4-3 possessed the same nucleotide sequence in their 
pol genes, whereas bat-CoV HKU4-4 contained one non-syno- 
nymous substitution, at position 871 [TAT (Tyr) —> CAT (His)], 
and two synonymous substitutions, at positions 384 and 2637, 
compared to the other three strains. All four strains possessed the 
same nucleotide sequence in their helicase genes. Of the four 
strains of bat-CoV-HKU5, bat-CoV HKU5-2, bat-CoV HKU5-3 
and bat-CoV HKU5-5 possessed the same nucleotide sequence 
in their pol genes, whereas bat-CoV HKU5-1 contained two 
non-synonymous substitutions, at positions 2158 and 2160 
[AAA (Lys) —> GAG (Glu)] and 84 synonymous substitutions 
compared to the other three strains. In addition, two peaks (T and 
C) were consistently observed at nucleotide position 1279 of the 
pol gene in bat-CoV HKU5-1, suggesting the presence of quasi¬ 
species. As for the helicase genes, bat-CoV HKU5-2, bat-CoV 
HKU5-3 and bat-CoV HKU5-5 are also more closely related to 
each other than to bat-CoV HKU5-1. There were 62 


Fig. 2. Phylogenetic trees of Pol (a) and helicase (b) showing the relationship of the novel bat coronaviruses to known coronaviruses. The trees were inferred from 
amino acid sequence data (949 amino acid positions for RNA-dependent RNA polymerase and 609 amino acid positions for helicase) by the neighbor-joining method. 
Numbers at nodes indicated levels of bootstrap support calculated from 1000 trees. The scale bar indicates the estimated number of substitutions per 50 amino acids. 
Only three of the 21 strains of bat-SARS-CoV are shown. GenBank accession numbers are in brackets and names of bats from which the coronaviruses were recovered 
are in square brackets. The 13 strains of the six novel coronaviruses in bats are in bold. HCoV-229E, human coronavirus 229E; PEDV, porcine epidemic diarrhea virus; 
TGEV, porcine transmissible gastroenteritis virus; FeCoV, feline coronavirus; HCoV-NL63, human coronavirus NL63; HCoV-OC43, human coronavirus OC43; 
MHV, murine hepatitis virus; BCoV, bovine coronavirus; CoV-HKUl, coronavirus HKU1; SARS-CoV, SARS coronavirus; bat-SARS-CoV, SARS coronavirus-like 
virus found in bats; IBV, infectious bronchitis virus; MR, Myotis ricketti; RS, Rhinolophus sinicus', MP, Miniopterus pusillus; MM, Miniopterus magnater; TP, 
Tylonycteris pachypus ; PA, Pipistrellus abramus. 
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synonymous substitutions between bat-CoV HKU5-1 and the 
other three strains. In addition, between bat-CoV HKU5-3 and 
the other three strains, there were two nucleotide substitutions, at 


positions 412 and 413, resulting in non-synonymous substitu 
tion [CTT (Leu) —► ACT (Thr)], and one synonymous substi 
tution at position 411 (ACA —► ACT). Between bat-CoV HKU5 


a 


1000 


1000 


1000 


-FeCoV (NC.007025) 

-TGEV (NC_002306) 

-Bat-CoV HKU6-1 (DQ249224) [MR] 


A 


652 


169 


1000 


963 


673 


-PEDV (N_C003436) 

1000 |Bat-CoV HKU2-2 (DQ249213) [RS] 
'Bat-CoV HKU2-1 (DQ249235) [RS] 
1000 |Bat-CoV 61-1 (DQ249232) [MP] 

[Bat-CoV 61-2 (DQ249233) [MP] 
L Bat-CoV 61-3 (DQ249234) (MP) 

Bat-CoV HKU8-1 (DQ249228) [MP| 

—Bat-CoV HKIJ7-1 (DQ249226) [MM] 
-HCoV-NL63 (NC_005831) 


688 


G1 


1000 


1000 


878 


1000 



J 

A 


7 


1000 


839 


K 




-HCoV-229H (NC.002645) 

Bat-CoV HKU4-2 (DQ074652) [TP] 

Bat-CoV HKU4-1 (DQ249214) [TP] 

Bat-CoV HKU4-3 (DQ249215) [TP] 

Bat-CoV HKU4-4 (DQ249216) [TP] 

rBat-CoV HKU5-1 (DQ249217) [PA] 

Bat-CoV HKU5-5 (DQ249221) |PA| 

Bat-CoV HKU5-3 (DQ249219) [PA] 

Bat-CoV HKU5-2 (DQ249218) |PA| 

Bat-SARS-CoV HKU3-3 (DQ084200) [RS] ^ 
Bat-SARS-CoV HKU3-2 (DQ084199) |RS] 
Bat-SARS-CoV HKU3-1 (DQ022305) [RS] 
l_rBat-SARS-CoV Rp3 (DQ071615) 

■-SARS-CoV (NC 004718) 
j—BCoV (NC.003045) 

1 -HCoV-OC43 (AY585229) 

—MHV (NC_001846) 

-CoV-HKUl (NC_006577) 


G2c 


G2b 


■IBV(NC_001451) 


] 


G2a 

G3 


1000 


1000 


900 


1000 


981 


525 


1000 


1000 


348 


982 


1000 


1000 


894 


909 


1000 


1000 


586 


1000 


000 


0.02 


-TGEV (NC_002306) 

-Bat-CoV HKU6-1 (DQ249247) [MR] 

-PEDV (NC.003436) 

-Bat-CoV 61-2 (DQ249250) |MP] 

(Bat-CoV 61-3 (DQ249251) [MP] 

Bat-CoV 61-1 (DQ364568) [MP| 

-Bat-CoV HKU8-1 (DQ364567) [MP] 

-Bat-CoV HKU7-1 (DQ249249) [MM] 

_| Bat-CoV HKU2-2 (DQ249237) |RS] 

Bat-CoV HKU2-1 (DQ249236) [RS] 

-HCoV-NL63 (NC.005831) 

-HCoV-229E (NC_002645) 

Bat-CoV HKU4-4 (DQ249241) I TP] 
Bat-CoV HKU4-3 (DQ249240) |TP] 
Bat-CoV HKU4-2 (DQ249239) |TP| 
Bat-CoV HKU4-1 (DQ249238) [TP] 

Bat-CoV IIKU5-2 (DQ249243) [PA] 

Bat-CoV HKU5-5 (DQ249246) [PA] 

L Bat-CoV HKIJ5-3 (DQ249244) [PA] 

Bat-CoV HKU5-1 (DQ249242) |PA] 

|Bat-SARS-CoV HKU3-1 (DQ022305) [RS| 
—IBat-SARS-CoV HKU3-3 (DQ084200) [RS] 

1 Bat-SARS-CoV HKU3-2 (DQ084199) [RS] 


A 


G1 


J 

A 


G2c 


J 


_r Bal-SARS-CoV Rp3 (DQ071615) 
^SARS-CoV (NC_004718) 

-MHV (NC_001846) 

-CoV-HKUl (NC.006577) 

rBCoV (NC.003045) 

HCoV-OC43 (AY585229) 


G2b 


G2a 


IBV (NC_001451) 


] G3 
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Table 2 


Comparison of G + C contents and amino acid identities among the RNA dependent RNA polymerases and helicases of the six novel coronaviruses in bats and those of 
other coronavimses 


Coronaviruses a 

G + C 

Features of the RNA dependent RNA polymerases/Helicases of the six novel bat coronaviruses 



contents of 

coronaviruses 

G + C 

Pairwise amino acid identity (%) 






contents 

Bat-CoV HKU2 

Bat-CoV HKU4 

Bat-CoV HKU5 

Bat-CoV HKU6 

Bat-CoV HKU7 

Bat-CoV HKU8 

Group 1 

HCoV-229E 

0.38 

0.38/0.39 

81/81 

58/62 

57/62 

78/78 

80/81 

74/82 

PEDV 

0.42 

0.40/0.42 

78/78 

60/62 

59/63 

88/90 

81/81 

75/82 

TGEV 

0.38 

0.37/0.38 

75/77 

59/61 

59/61 

75/75 

75/73 

70/74 

FeCoV 

0.38 

0.38/- 

76/- 

60/- 

60/- 

75/- 

75/- 

76/- 

HCoV-NL63 

0.34 

0.34/0.35 

79/81 

57/61 

57/62 

79/82 

81/83 

75/83 

Bat-CoV 61-3 

— 

0.39/0.39 

75/81 

63/64 

60/64 

75/82 

79/86 

79/87 

Bat-CoV HKU2 

— 

0.39/0.40 

— 

58/61 

58/62 

78/78 

82/80 

74/81 

Bat-CoV HKU6 

— 

0.39/0.39 

78/78 

59/63 

59/63 

— 

82/80 

76/81 

Bat-CoV HKU7 

— 

0.40/0.40 

82/80 

59/62 

59/62 

82/80 

— 

84/92 

Bat-CoV HKU8 

— 

0.40/0.42 

74/81 

56/61 

56/62 

76/81 

84/92 

— 

Group 2 

CoV-HKUl 

0.32 

0.32/0.33 

55/53 

67/66 

68/66 

57/56 

57/54 

52/54 

HCoV-OC43 

0.37 

0.36/0.38 

56/54 

68/67 

68/67 

57/57 

56/55 

53/55 

MHV 

0.42 

0.39/0.39 

56/55 

68/67 

68/67 

57/57 

57/56 

53/55 

BCoV 

0.37 

0.36/0.38 

56/54 

68/67 

68/67 

58/57 

57/56 

53/55 

SARS-CoV 

0.41 

0.39/0.41 

59/61 

71/70 

71/71 

60/61 

58/60 

55/61 

Bat-SARS-CoV HKU3 

0.41 

0.40/0.40 

59/61 

71/70 

71/71 

60/61 

58/60 

59/61 

(Lau et al., 2005) 
Bat-SARS-CoV 

0.41 

0.40/0.41 

59/61 

71/70 

71/71 

60/61 

58/60 

59/61 

(Li et al., 2005) 
Bat-CoV HKU4 


0.37/0.38 

58/61 


92/93 

59/63 

59/62 

56/61 

Bat-CoV HKU5 

— 

0.41/0.44 

58/62 

92/93 

— 

59/63 

59/62 

56/62 

Group 3 

IBV 

0.38 

0.38/0.37 

57/55 

60/57 

59/58 

56/57 

57/55 

54/56 


a HCoV-229E, human coronavirus 229E; PEDV, porcine epidemic diarrhea virus; TGEV, porcine transmissible gastroenteritis virus; FeCoV, feline coronavirus; 
HCoV-NL63, human coronavims NL63; HCoV-OC43, human coronavirus OC43; MHV, murine hepatitis virus; BCoV, bovine coronavirus; CoV-HKUl, coronavirus 
HKU1; SARS-CoV, SARS coronavirus; bat-SARS-CoV, SARS coronavirus-like virus found in bats; IBV, infectious bronchitis virus. 


2 and the other three strains, there were two non-synonymous 
substitutions, at positions 815 [GCA (Ala) —► GTA (Val)] and 
1435 [AGT (Ser) -► TGT (Cys)]. 

Discussion 

An astonishing diversity of coronaviruses was observed in 
bats in HKSAR. In this study, among the 13 different species 
of bats captured, a total of eight coronaviruses were observed, 
with six being previously undescribed. This diversity of 
coronaviruses in bats could be related to the unique properties 
of this group of mammals. First, bats account for about 980 of 
the 4800 mammalian species recorded in the world. Although 
HKSAR is an urbanized, subtropical city, it has extensive 
natural areas with 52 different species of terrestrial mammals, 
with 40% of the species being bats. This diversity of bat 
species would potentially provide a large number of different 
cell types for different coronaviruses. Second, the ability to fly 
has given bats the opportunity to go almost anywhere, free 
from obstacles to land-based mammals. Bats have been found 
at altitudes of as high as 5000 m. This ability of bats would 
have allowed possible exchange of viruses and/or their genetic 
materials with different kinds of living organisms. Third, the 


different environmental pressures such as food, climates, 
shelters and predators would have provided different selective 
pressures on parasitisation of different coronaviruses in 
different species of bats. Fourth, the habit of roosting gives 
the opportunity to a large number of bats gather together. This 
would have also facilitated exchange of viruses among indi¬ 
vidual bats. Interestingly, the nucleotide sequences of the pol 
genes of bat-CoV HKU4-1, bat-CoV HKU4-2 and bat-CoV 
HKU4-3, collected at the same location (location A, Fig. 1), 
were the same, but different from that of bat-CoV HKU4-4, 
collected at another location (location B, Fig. 1); and the 
nucleotide sequences of the pol and helicase genes of bat-CoV 
HKU5-2, bat-CoV HKU5-3 and bat-CoV HKU5-5, collected at 
the same location (location C, Fig. 1), were also more closely 
related to each other, than to those of bat-CoV HKU5-1, 
collected at another location (location B, Fig. 1). We speculate 
that this was due to circulation of closely related strains of 
coronaviruses among bats found at the same locations. 

From the results of the present study, we speculate that 
coronaviruses found in bats are bat genus/species specific, 
although one bat genus/species may harbor more than one type 
of coronavirus. Previous studies have shown that coronaviruses 
are particularly host specific and coevolved with their hosts, 
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although host-shifting has also been demonstrated (Jonassen et 
al., 2005; Liu etal., 2005; Lai, 1990; Sturman and Holmes, 1983; 
Rest and Mindell, 2003). In this study, a huge variety of 
coronaviruses of groups 1 and 2, but not group 3, are found in 
bats in HKSAR, with the different coronaviruses being present 
in specific genus/species of bats. For SARS-CoV-like viruses, it 
was detected in Chinese horseshoe bats ( Rhinolophus sinicus) in 
our locality (Lau et al., 2005), whereas in some other provinces 
in China, it was detected in some other species of horseshoe bats 
of the same genus {Rhinolophus ferrum equ in urn , Rhinolophus 
macrotis , Rhinolophus pearsoni and Rhinolophus pussilus ) (Li 
et al., 2005). In addition to SARS-CoV-like viruses, one 
additional type of coronavirus (bat-CoV HKU2) was also 
detected in Rhinolophus sinicus in the present study. Similarly, a 
coronavirus that we described previously was detected in species 
of the genus Miniopterus , Miniopterus pusillus in the present 
study and Miniopterus magnater , Miniopterus pusillus and 
Miniopterus schreibersii in the previous study (Poon et al., 
2005), and bat-CoV HKU8 was also found in Miniopterus 
pusillus. Furthermore, bat-CoV HKU4 was detected in lesser 
bamboo bat, bat-CoV HKU5 detected in Japanese pipistrelle, 
bat-CoV HKU6 detected in ricketf s big-footed bat, and bat-CoV 
HKU7 detected in greater bent-winged bats. On the other hand, 
there is no strict association between the group that a bat 
coronavirus belongs to and the corresponding bat genus/species 
that the virus resides, as both groups 1 and 2 coronaviruses can 
be found in Chinese horseshoe bats. This absence of association 
between coronavirus group among groups 1 and 2 coronaviruses 
and the species of the host is also observed in human 
coronaviruses, as humans are hosts to both group 1 
(HCoV229E and HCoV-NL63) and group 2 (HCoV229E, 
CoV-HKUl and SARS-CoV) coronaviruses. A larger scale 
study expanded to different regions will confirm this phenom¬ 
enon of host specificity. The availability of genomic and 
mitochondrial sequence data on the bat species that harbor the 
coronaviruses discovered in the present study will enable 
analysis of the phylogenetic relationship of these viruses in 
relation to the phylogeny of their hosts. 

Strains of bat-CoV HKU4 and bat-CoV HKU5 may 
constitute a new subgroup, group 2c, of group 2 coronaviruses. 
Using molecular and serological approaches, coronaviruses are 
divided into three groups. When SARS-CoV was discovered and 
its genome completely sequenced, it was first proposed that 
SARS-CoV belonged to “a fourth group” of coronavirus (Marra 
et al., 2003; Rota et al., 2003). However, using multiple 
approaches, including both genomic and proteomic approaches, 
it was confirmed that SARS-CoV probably is an early split-off 
from the group 2 coronavirus lineage (Eickmann et al., 2003; 
Snijder et al., 2003). It is now the consensus that SARS-CoV and 
the SARS-CoV-like viruses belong to a subgroup, group 2b, of 
group 2 coronaviruses, whereas the other group 2 coronaviruses, 
including murine hepatitis virus, bovine coronavirus, HCoV- 
OC43, and the recently discovered CoV-HKUl, belong to group 
2a. In the present study, phylogenetic analysis of two 
independent genes, pol and helicase genes, showed that the 
four strains of bat-CoV HKU4 and four strains of bat-CoV 
HKU5 were clustered and formed a unique lineage within group 


2 coronaviruses, distinct from the group 2a and group 2b 
coronaviruses. The results were in line with those obtained from 
partial sequencing of about 200 nucleotides of the nucleocapsid 
genes of these two coronaviruses (unpublished data). Moreover, 
these two novel coronaviruses appear to be more closely related 
to group 2b than group 2a coronaviruses in both phylogenetic 
trees. Complete genome sequencing will confirm whether they 
represent a novel subgroup in group 2 coronaviruses, are more 
related to group 2a (e.g. presence of hemagglutinin esterase 
gene) or group 2b coronaviruses (e.g. presence of unique domain 
in nsp3), and have evolved as a result of recombination. 
Interestingly, the bootstrap values were particularly low at some 
nodes in the group 1 coronaviruses (Fig. 2), implying the 
phylogenetic relationships among some members of this group 
remained unclear. Unfortunately, the inherent difficulty in 
cultivating coronaviruses, the low viral load in the small 
volumes of samples from these small mammals, together with 
the huge differences in their nucleotide sequences from known 
coronaviruses, have caused major difficulties in achieving 
complete genome sequencing on the novel coronaviruses at 
this stage. 

Materials and methods 

Bat surveillance and sample collection 

The study was approved by the Department of Agriculture, 
Fisheries and Conservation, HKSAR; and Committee on the 
Use of Live Animals in Teaching and Research, The University 
of Hong Kong. Three hundred and nine bats, of 13 species, were 
captured from 20 different locations in rural areas of HKSAR, 
including water tunnels, abandoned mines, sea caves and 
forested areas, over a 16-month period (April 2004-July 2005) 
(Fig. 1, Table 1). Nasopharyngeal and anal swabs were collected 
by a veterinary surgeon. Swabs were taken with sterile swabs 
and kept in viral transport medium at 4 °C before processing 
(Poon et al., 2005). 

RNA extraction 

Viral RNA was extracted from the nasopharyngeal and anal 
swabs using QIAamp Viral RNA Mini Kit (QIAgen, Hilden, 
Germany). The RNA pellet was resuspended in 10 pi of DNase- 
free, RNase-free double-distilled water and was used as the 
template for RT-PCR. 

RT-PCR of pol gene of coronaviruses using conserved primers 
and DNA sequencing 

Coronavirus screening was performed by amplifying a 440- 
bp fragment of the RNA dependent RNA polymerase {pol ) gene 
of coronaviruses using conserved primers (5'-GGTTGGGAC- 
TATCCTAAGTGTGA-3' and 5 '-CCATCATC AGATAGAAT- 
CATCATA-3') designed by multiple alignment of the nucleotide 
sequences of available pol genes of known coronaviruses (Woo 
et al., 2005c). Reverse transcription was performed using the 
Superscript II kit (Invitrogen, San Diego, CA, USA). The PCR 
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mixture (50 pi) contained cDNA, PCR buffer (10 mM Tris-HCl 
pH 8.3, 50 mM KC1, 3 mM MgCl 2 and 0.01% gelatin), 200 pM 
of each dNTPs and 1.0 U Taq polymerase (Boehringer 
Mannheim, Germany). The mixtures were amplified in 40 
cycles of 94 °C for 1 min, 48 °C for 1 min and 72 °C for 1 min 
and a final extension at 72 °C for 10 min in an automated 
thermal cycler (Perkin-Elmer Cetus, Gouda, The Netherlands). 
Standard precautions were taken to avoid PCR contamination 
and no false-positive was observed in negative controls. 

The PCR products were gel-purified using the QIAquick gel 
extraction kit (QIAgen, Hilden, Germany). Both strands of the 
PCR products were sequenced twice with an ABI Prism 
3700 DNA Analyzer (Applied Biosystems, Foster City, CA, 
USA), using the two PCR primers. The sequences of the PCR 
products were compared with known sequences of the pol genes 
of coronaviruses in the GenBank database. 

Viral culture 

One to two samples of each novel coronavirus were cultured 
on LLC-Mk2 (rhesus monkey kidney), MRC-5 (human lung 
fibroblast), FRhK-4 (rhesus monkey kidney), Huh-7.5 (human 
hepatoma), Vero E6 (African green monkey kidney), and HRT- 
18 (colorectal adenocarcinoma) cells. 

RT-PCR and sequencing of the complete pol and helicase 
genes of coronaviruses in bats 

The complete pol and helicase genes of the coronavimses 
identified from bats were amplified and sequenced using different 
sets of degenerate primers designed by multiple alignment of the 
pol and helicase genes of the various groups of coronaviruses and 
additional primers designed from the results of the first and 
subsequent rounds of sequencing. Detailed primer sequences will 
be provided on request. Sequences were assembled and manually 
edited to produce the final sequences of the pol and helicase 
genes. The nucleotide and the deduced amino acid sequences of 
the pol and helicase genes were compared to those of other 
coronavimses. Phylogenetic tree constmction was performed by 
using the neighbor-joining method with GrowTree software 
(Genetics Computer Group, Madison, WI). 

Nucleotide sequence accession numbers 

The nucleotide sequences of the six novel coronavimses 
identified from 13 bats in the present study have been lodged 
within the GenBank sequence database under accession 
numbers DQ249213-DQ249219, DQ249221, DQ249224, 
DQ249226, DQ249228, DQ249235 and DQ074652 for pol 
genes and DQ249236-DQ249244, DQ249246, DQ249247, 
DQ249249 and DQ364567 for helicase genes respectively. 
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